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Abstract 

Magnetostatic fields in accelerators are conventionally described in terms of 
multipoles. We show that in two dimensions, multipole fields do provide so- 
lutions of Maxwell's equations, and we consider the distributions of electric 
currents and geometries of ferromagnetic materials required (in idealized sit- 
uations) to generate specified multipole fields. Then, we consider how to de- 
termine the multipole components in a given field. Finally, we show how the 
two-dimensional multipole description may be extended to three dimensions; 
this allows fringe fields, or the main fields in such devices as undulators and 
wigglers, to be expressed in terms of a set of modes, where each mode provides 
a solution to Maxwell's equations. 

1 Maxwell's equations 

Maxwell's equations may be written in differential form as follows: 

div5 = p, (1) 
divS = 0, (2) 
^ dD 

cm\H = J+^, (3) 
at 

dB 

cuxlE = (4) 

The fields B (magnetic flux density) and E (electric field strength) determine the force on a particle of 
charge q travelling with velocity v (the Lorentz force equation): 



F = q(^E + vx 



The electric displacement D and magnetic intensity H are related to the electric field and magnetic flux 
density by: 

D = sE, 
B = ^H. 

The electric permittivity e and magnetic permeability fi depend on the medium within which the fields 
exist. The values of these quantities in vacuum are fundamental physical constants. In SI units: 

/xo = 47r X 10"'^Hm"\ 
1 

^0 = 2' 

where c is the speed of light in vacuum. The permittivity and permeability of a material characterize the 
response of that material to electric and magnetic fields. In simplified models, they are often regarded 
as constants for a given material; however, in reality the permittivity and permeability can have a com- 
plicated dependence on the fields that are present. Note that the relative permittivity Sr and the relative 
permeability fir are frequently used. These are dimensionless quantities, defined by: 



That is, the relative permittivity is the permittivity of a material relative to the permittivity of free space, 
and similarly for the relative permeability. 

The quantities p and J are respectively the electric charge density (charge per unit volume) and 
electric current density ( J • n is the charge crossing unit area perpendicular to unit vector n per unit time). 
Equations ^ and Q are independent of p and J, and are generally referred to as the "homogeneous" 
equations; the other two equations, jl]) and Q are dependent on p and J, and are generally referred to as 
the "inhomogeneous" equations. The charge density and current density may be regarded as sources of 
electromagnetic fields. When the charge density and current density are specified (as functions of space, 
and, generally, time), one can integrate Maxwell's equations Q-Q to find possible electric and magnetic 
fields in the system. Usually, however, the solution one finds by integration is not unique: for example, 
the field within an accelerator dipole magnet may be modified by propagating an electromagnetic wave 
through the magnet. However, by imposing certain constraints (for example, that the fields within a 
magnet are independent of time) it is possible to obtain a unique solution for the fields in a given system 
of electric charges and currents. 

Most realistic situations are sufficiently complicated that solutions to Maxwell's equations cannot 
be obtained analytically. A variety of computer codes exist to provide solutions numerically, once the 
charges, currents, and properties of the materials present are all specified, see for example References |[T]- 
[3|. Solving for the fields in realistic (three-dimensional) systems often requires a reasonable amount of 
computing power; some sophisticated techniques have been developed for solving Maxwell's equations 
numerically with good efficiency ||4j. We do not consider such techniques here, but focus instead on 
the analytical solutions that may be obtained in idealized situations. Although the solutions in such 
cases may not be sufficiently accurate to complete the design of a real accelerator magnet, the analytical 
solutions do provide a useful basis for describing the fields in real magnets, and provide also some 
important connections with the beam dynamics in an accelerator. 

An important feature of Maxwell's equations is that, for systems containing materials with con- 
stant permittivity and permeability (i.e. permittivity and permeability that are independent of the fields 
present), the equations are linear in the fields and sources. That is, each term in the equations involves 
a field or a source to (at most) the first power, and products of fields or sources do not appear. As a 
consequence, the principle of superposition applies: if Bi and §2 are solutions of Maxwell's equations 
with the current densities Ji and J2, then the field Bt = Bi-\- B2 will be a solution of Maxwell's equa- 
tions, with the source given by the total current density Jt = Ji + J2- This means that it is possible to 
represent complicated fields as superpositions of simpler fields. An important and widely used analysis 
technique for accelerator magnets is to decompose the field (determined from either a magnetic model, 
or from measurements of the field in an actual magnet) into a set of multipoles. While it is often the ideal 
to produce a field consisting of a single multipole component, this is never perfectly achieved in practice: 
the multipole decomposition indicates the extent to which components other than the "desired" multipole 
are present. Multipole decompositions also produce useful information for modelling the beam dynam- 
ics. Although the principle of superposition strictly only applies in systems where the permittivity and 
permeability are independent of the fields, it is always possible to perform a multipole decomposition of 
the fields in free space (e.g. in the interior of a vacuum chamber), since in that region the permittivity 
and permeability are constants. However, it should be remembered that for nonlinear materials (where 
the permeability, for example, depends on the magnetic field strength), the field inside the material com- 
prising the magnet will not necessarily be that expected if one were simply to add together the fields 
corresponding to the multipole components. 

Solutions to Maxwell's equations lead to a rich diversity of phenomena, including the fields around 
charges and currents in certain simple configurations, and the generation, transmission and absorption of 
electromagnetic radiation. Many existing texts cover these phenomena in detail; see, for example, the 
authoritative text by Jackson |5|. Therefore, we consider only briefly the electric field around a point 
charge and the magnetic field around a long straight wire carrying a uniform current: our main purpose 
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here is to remind the reader of two important integral theorems (Gauss' theorem, and Stokes' theorem), 
of which we shall make use later. In the following sections, we will discuss analytical solutions to 
Maxwell's equations for situations relevant to some of the types of magnets commonly used in acceler- 
ators. These include multipoles (dipoles, quadrupoles, sextupoles, and so on), solenoids, and insertion 
devices (undulators and wigglers). We consider only static fields. We begin with two-dimensional fields, 
that is fields that are independent of one coordinate (generally, the coordinate representing the direction 
of motion of the beam). We will show that multipole fields are indeed solutions of Maxwell's equations, 
and we will derive the current distributions needed to generate "pure" multipole fields. We then discuss 
multipole decompositions, and compare techniques for determining the multipole components present in 
a given field from numerical field data (from a model, or from measurements). Finally, we consider how 
the two-dimensional multipole decomposition may be extended to three-dimensional fields, to include 
(for example) insertion devices, and fringe fields in multipole magnets. 

2 Integral theorems and the physical interpretation of Maxwell's equations 
2.1 Gauss' theorem and Coulomb's law 

Guass' theorem states that for any smooth vector field a: 

/ diwadV^ [ a-dS, 

Jv JdV 

where y is a volume bounded by the closed surface dV. Note that the area element dS is oriented to 
point out of V. 

Gauss' theorem is helpful for obtaining physical interpretations of two of Maxwell's equations, 
([T]) and (|2]). First, applying Gauss' theorem to ([!]) gives: 

/ diw DdV= [ D-dS^q, (6) 

Jv JdV 

where q = Jy p dV is the total charge enclosed by dV. 

Suppose that we have a single isolated point charge in an homogeneous, isotropic medium with 
constant permittivity e. In this case, it is interesting to take dV to be a sphere of radius r. By symmetry, 
the magnitude of the electric field must be the same at all points on dV, and must be normal to the 
surface at each point. Then, we can perform the surface integral in (|6]): 

/ D-dS^Anr^D. 

JdV 

This is illustrated in Fig.[TJ the outer circle represents a cross-section of a sphere (dV) enclosing volume 
V, with the charge q at its centre. The black arrows in Fig.|T|represent the electric field lines, which are 
everywhere perpendicular to the surface dV. Since D = eE, we find Coulomb's law for the magnitude 
of the electric field around a point charge: 

Applied to Maxwell's equation Q, Gauss' theorem leads to: 

/ diw BdV= [ B-dS^O. 

Jv JdV 

In other words, the magnetic flux integrated over any closed surface must equal zero - at least, until we 
discover magnetic monopoles. Lines of magnetic flux occur in closed loops; whereas lines of electric 
field can start (and end) on electric charges. 
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Fig. 1: Electric field lines from a point charge q. The field lines are everywhere perpendicular to a spherical surface 
centered on the charge. 



2.2 Stokes' theorem and Ampere's law 

Stokes' theorem states that for any smooth vector field a: 

/ curl a • dS = a • dl, (7) 

Js JdS 

where the loop dS bounds the surface S. Applied to Maxwell's equation ([3]), Stokes' theorem leads to: 

/ H-dl= [ J-dS, (8) 
JdS Js 

which is Ampere's law. From Ampere's law, we can derive an expression for the strength of the magnetic 
field around a long, straight wire carrying current /. The magnetic field must have rotational symmetry 
around the wire. There are two possibilities: a radial field, or a field consisting of closed concentric 
loops centred on the wire (or some superposition of these fields). A radial field would violate Maxwell's 
equation Q. Therefore, the field must consist of closed concentric loops; and by considering a circular 
loop of radius r, we can perform the integral in Eq. ([8]): 

where / is the total current carried by the wire. In this case, the line integral is performed around a loop 
dS centered on the wire, and in a plane perpendicular to the wire: essentially, this corresponds to one of 
the magnetic field lines, see Fig.|2j The total current passing through the surface S bounded by the loop 
dS is simply the total current /. 

In an homogeneous, isotropic medium with constant permeability fj., B = fJ^oH, and we obtain the 
expression for the magnetic flux density at distance r from the wire: 

27T/j.r 

This result will be useful when we come to consider how to generate specified multipole fields from 
current distributions. 

Finally, applying Stokes' theorem to the homogeneous Maxwell's equation Q, we find: 

/ E-dt= [ B-dS. (10) 
JdS dt Js 
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Fig. 2: Magnetic field fines around a long straight wire carrying a current /. 



Defining the electromotive force S' as the integral of the electric field around a closed loop, and the 



magnetic flux $ as the integral of the magnetic flux density over the surface bounded by the loop, Eq. ([TO]) 
gives: 

(11) 



which is Faraday's law of electromagnetic induction. Faraday's law is significant for magnets with 
time-dependent fields, such as pulsed magnets (used for injection and extraction), and magnets that are 
"ramped" (for example, when changing the beam energy in a storage ring). The change in magnetic 
field will induce a voltage across the coil of the magnet, that must be taken into account when designing 
the power supply. Also, the induced voltages can induce eddy currents in the core of the magnet, or in 
the coils themselves, leading to heating. This is an issue for superconducting magnets, which must be 
ramped slowly to avoid quenching |6|. 



2.3 Boundary conditions 

Gauss' theorem and Stokes' theorem can be applied to Maxwell's equations to derive constraints on 
the behaviour of electromagnetic fields at boundaries between different materials. Here, we shall focus 
on the boundary conditions on the magnetic field: these conditions will be useful when we consider 
multipole fields in iron-dominated magnets. 

Consider first a short cylinder or "pill box" that crosses the boundary between two media, with the 
flat ends of the cylinder parallel to the boundary, see Fig.|3](a). Applying Gauss' theorem to Maxwell's 
equation (|2]) gives: 

/ divSrfl/= / B-dS^O, 

Jv JdV 

where the boundary dV encloses the volume V within the cylinder. If we take the limit where the length 
of the cylinder (2h) approaches zero, then the only contributions to the surface integral come from the 
flat ends; if these have infinitesimal area dS, then since the orientations of these surfaces are in opposite 
directions on opposite sides of the boundary, and parallel to the normal component of the magnetic field, 
we find: 

- Bi^ dS + B2± dS = 0, 

where Bi± and B2± are the normal components of the magnetic flux density on either side of the bound- 
ary. Hence: 

Si± = S2±. (12) 
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Fig. 3: (a) Left: "Pill box" surface for derivation of the boundary conditions on the normal component of the 
magnetic flux density at the interface between two media, (b) Right: Geometry for derivation of the boundary 
conditions on the tangential component of the magnetic intensity at the interface between two media. 



In other words, the normal component of the magnetic flux density is continuous across a boundary. 

A second boundary condition, this time on the component of the magnetic field parallel to a bound- 
ary, can be obtained by applying Stokes' theorem to Maxwell's equation ([3]). In particular, we consider a 
surface S bounded by a loop dS that crosses the boundary of the material, see Fig.|3](b). If we integrate 
both sides of Eq. ([3]) over that surface, and apply Stokes' theorem Q, we find: 

/ curlH -dS^ [ H-dl^I^^ [ D-dS, 
Js JdS ot Js 

where / is the total current flowing through the surface S. Now, let the surface S take the form of a thin 
strip, with the short ends perpendicular to the boundary, and the long ends parallel to the boundary. In 
the limit that the length of the short ends goes to zero, the area of S goes to zero: both the current flowing 
through the surface S, and the electric displacement integrated over S become zero. However, there are 
still contributions to the integral of H around dS from the long sides of the strip. Thus, we find that: 

^i|| =^2||, (13) 

where Hi\\ is the component of the magnetic intensity parallel to the boundary at a point on one side of 
the boundary, and H2\\ is the component of the magnetic intensity parallel to the boundary at a nearby 
point on the other side of the boundary. In other words, the tangential component of the magnetic 
intensity H is continuous across a boundary. 

We can derive a stronger contraint on the magnetic field at a boundary in the case that the material 
on one side of the boundary has infinite permeability (which can provide a reasonable model for some 



ferromagnetic materials). Since B = it follows from ( |13| ) that: 

-Sill _ 
111 112 ' 

and in the limit 112 oc, while fii remains finite, we must have: 

Sill = 0. (14) 
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In other words, the magnetic flux density at the surface of a material of infinite permeability must be 
perpendicular to that surface. Of course, the permeability of a material characterizes its response to 
an applied external magnetic field: in the case that the permeability is infinite, a material placed in an 
external magnetic field acquires a magnetization that exactly cancels any component of the external field 
at the surface of the material. 



3 Two-dimensional multipole fields 

Consider a region of space free of charges and currents; for example, the interior of an accelerator 
vacuum chamber (at least, in an ideal case, and when the beam is not present). If we further exclude 
propagating electromagnetic waves, then any magnetic field generated by steady currents outside the 
vacuum chamber must satisfy: 



diwB = 
curlS = 



0, 
0. 



(15) 
(16) 



Eq. ( [T5) ) is just Maxwell's equation ([2]), and Eq. ( [T6| ) follows from Maxwell's equation ([3]) given that 
J = 0, B = jj^qH, and derivatives with respect to time vanish. 

We shall show that a magnetic field B = {Bx^ By^ Bz) with Bz constant, and Bx, By given by: 



By + iBx = Cn{x + iy) 



n-l 



(17) 



where i = v — 1 and Cn is a (complex) constant, satisfies Eqs. ( |15| ) and ( |16| ). Note that the field com- 
ponents Bx and By are real, and are obtained from the imaginary and real parts of the right hand side of 
Eq. ( [TT] ). To show that the above field satisfies Eqs. ( [T5] ) and ( [T6| ), we apply the differential operator 



d_ .d_ 

dx dy 



to each side of Eq. ( 17 ). Applied to the left hand side, we find: 



+ i-Qy ) (By + iB, 



dx 



dB„ OB, 



dx 



dy 



+ i 



OB^ dB. 



dx 



+ 



dy 



(18) 



(19) 



Applied to the right hand side of Eq. ( 17 ), the differential operator (18) gives: 



d 



d 



^ + ) Cn{x + iy)^-' = Cn{n -l){x + iyf-' + i^C„(n - \){x + iyf-' = 0. (20) 



Combining Eqs. ([TTjl, ([T9]» and (|20]), we find: 

d_B^ 
dx 
dB„ 



+ 



dx 



dBy 

dy 
dy 



0, 
0. 



Finally, we note that Bz is constant, so any derivatives of Bz vanish; furthermore, Bx and By are inde- 
pendent of z, so any derivatives of these coordinates with respect to z vanish. Thus, we conclude that for 
the field 



divB 
cuvlB 



0, 
0, 



(21) 
(22) 



and that this field is therefore a solution to Maxwell's equations within the vacuum chamber. Of course, 
this analysis tells us only that the field is a possible physical field: it does not tell us how to generate such 
a field. The problem of generating a field of the form Eq. ( [T7| ) we shall consider in Section |4] 
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Fields of the form (Tf) are known as multipole fields. The index n (an integer) indicates the order 
of the multipole: n = 1 is a dipole field, n = 2 is a quadrupole field, n = 3 is a sextupole field, and so 
on. A solenoid field has = for all n, and non-zero; usually, a solenoid field is not considered 
a multipole field, and we assume (unless stated otherwise) that = in a multipole magnet. Note 
that we can apply the principle of superposition to deduce that a more general magnetic field can be 
constructed by adding together a set of multipole fields: 

oo 

By + iB, = J2 ^n{x + iyT-\ (23) 

n=l 

A "pure" multipole field of order n has ^ for only that one value of n. 

The coefficients Cn in Eg. [23] characterise the strength and orientation of each multipole compo- 
nent in a two-dimensional magnetic field. It is sometimes more convenient to express the field using 
polar coordinates, rather than Cartesian coordinates. Writing x = r cos 9 and = r sin 6, we see that 
Eq. ( [23] ) becomes: 

oo 
n=l 

By writing the multipole expansion in this form, we see immediately that the strength of the field in a 
pure multipole of order n varies as r^~^ with distance from the magnetic axis. We can go a stage further, 
and express the field in terms of polar components: 

By + iB^ = Br sin 0^ BqcosO^ iBr cos 9 - iBq sin = {Bq + iBr) e"'^, 

thus: 

oo 

Be ^iBr^Yl Cnr'^-^e''^^. (24) 

n=l 

By writing the field in this form, we see that for a pure multipole of order n, rotation of the magnet 
through n/n around the z axis simply changes the sign of the field. We also see that if we write: 

— \^n\ ^ 

then the value of 0^ (the phase of Cn) determines the orientation of the field. Conventionally, a pure 
multipole with 0^ = is known as a "normal" multipole, while a pure multipole with 0^ = 7r/2 is 
known as a "skew" multipole. 

The units of Cn depend on the order of the multipole. In SI units, for a dipole, the units of Ci are 
tesla (T); for a quadrupole, the units of C2 are Tm~^; for a sextupole, the units of C3 are Tm~^, and so 
on. It is sometimes preferred to specify multipole components in dimensionless units. In that case, we 
introduce a reference field, B^^f, and a reference radius, i?ref • The multipole expansion is then written: 



By + iB, = S,ef 5^ (a, + ibn) ( ) . (25) 



n-l 



1 \ ^ref 

n=l ^ 

This is a standard notation for multipole fields, see for example |7|. In polar coordinates: 

n-l 



By + iB, = S,ef Tian + ibn) e^("-'^'. 



n=l ^ ^ 



(26) 



The reference field and reference radius can be chosen arbitrarily, but must be specified if the coefficients 
Qn and bn are to be interpreted fully. 
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n= 1 



n= 1 




Fig. 4: "Pure" multipole fields. Top: dipole. Middle: quadrupole. Bottom: sextupole. Fields on the left are normal 
(a„ positive); those on the right are skew (6„ positive). The positive y axis is vertically up; the positive x axis is 
horizontal and to the right. 
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Note that for a pure multipole field of order n, the coefficients and bn are related to the derivates 
of the field components with respect to the x and y coordinates. Thus, for a normal multipole: 



— = (n - 1)! 



and for a skew multipole: 

079—1 T-) 

5ref 



(n-1)!- 



A normal dipole has a uniform vertical field; a normal quadrupole has a vertical field for ^ = 0, that 
increases linearly with x; a normal sextupole has a vertical field for 7/ = that increases as the square of 
x; and so on. 



4 Generating multipole fields 

Given a system of electric charges and currents, we can integrate Maxwell's equations to find the electric 
and magnetic fields generated by those charges and currents. In general, the integration must be done 
numerically; but for simple systems it is possible to find analytical solutions. We considered two such 
cases in Section|2} the electric field around an isolated point charge, and the magnetic field around a long 
straight wire carrying a constant current. 

It turns out that we can combine the magnetic fields from long, straight, parallel wires to generate 
pure multipole fields. It is also possible to generate pure multipole fields using high-permeability mate- 
rials with the appropriate geometry. We consider both methods in this section. For the moment, we deal 
with "idealised" geometries without practical constraints. We discuss the impact of some of the practical 
limitations in later sections. 



4.1 Current distribution for a multipole field 

Our goal is to determine a current distribution that will generate a pure multipole field of specified order. 
As a first step, we derive the multipole components in the field around a long straight wire carrying a 
uniform current. We already know, from Ampere's law (|9]) that the field at distance r from a long straight 
wire carrying current / in free space has magnitude given by: 

T 

B 



and that the direction of the field describes a circle centred on the wire. To derive the multipole compo- 
nents in the field, we first derive an expression for the field components at an arbitrary point (x, y) from 
a wire carrying current /, passing through a point (xq, yo) and parallel to the z axis. 

Since we are working in two dimensions, we can represent the components of a vector by the real 
and imaginary parts of a complex number. Thus, the vector from (xq, yo) io point (x, y) is given by 
re^^ — roe^^o, and the magnitude of the field at (x, y) is: 

B ' ' 



27V fio — Toe^^o I 

The geometry is shown in Fig.jsj The direction of the field is perpendicular to the line from (xq, yo) to 
(x, y). Since a rotation through 90° can be represented by a multiplication by z, we can write: 

/ i (re'^ - roe^^o) 
Bx + iBy — ■ — 2 ? 
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Fig. 5: Geometry for calculation of multipole components in the field around a long, straight wire carrying a 
uniform current. The wire passes through roe*^°, and is parallel to the z axis (the direction of the current is 
pointing out of the page). 



and hence: 

/ (re-^^ -roe-^^o) / 1 
Now, we write the magnetic field as: 

and use the Taylor series expansion for (1 — (^)~^, where C is a complex number with |(^| < 1: 



^ oo 



to write: 



R + iB, ^ ^ e-^^° y - e^(-i)(^-^o). (27) 



Eq. ([27]) is valid for r < tq. Comparing with the standard multipole expansion, Eq. ( [26| ), we see that if 
we choose for the reference field Sref and the reference radius i?ref • 

^ref = 



then the coefficients for the multipole components in the field are given by: 

The field around a long straight wire can be represented as an infinite sum over all multipoles. 
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Now we consider a current flowing on the surface of a cylinder of radius tq. Suppose that the 
current flowing in a section of the cylinder at angle Oq and subtending angle dOo at the origin is I{9o) dOo. 
By the principle of superposition, we can obtain the total field by summing the contributions from the 
currents at all values of ^o- 



We see that the multipole components are related to the Fourier components in the current distribution 
over the cylinder of radius tq. In particular, if we consider a current distribution with just a single Fourier 
component: 

/(^o) = /ocos(no0o-0), (29) 
the integral in the right hand side of Eq. ( [28] ) vanishes except for n = no, and we find: 

no-l 



By + ^B^ = --^ (^Y~ e^(-o-i)^,-.0. 



The current distribution ([29]) generates a pure multipole field of order no- If we choose, as before: 



27r/xo^o ' 



then the multipole coefficients are: 



bn + ian —Tve' 



-i(f) 



The parameter gives the "angle" of the current distribution. For = or = tt, the current generates 
a normal multipole; for cp = ±7r/2, the current generates a skew multipole. 

The fact that a sinusoidal current distribution on a cylinder can generate a pure multipole field is 
not simply of academic interest. By winding wires in an appropriate pattern on a cylinder, it is possible 
to approximate a sinusoidal current distribution closely enough to produce a multipole field of acceptable 
quality for many applications. Usually, several layers of windings are used with a different pattern of 
wires in each layer, to improve the approximation to a sinusoidal current distribution. Superconducting 
wires can be used to achieve strong fields: an example of superconducting quadrupoles in the LHC is 
shown in Fig. [7] 



4.2 Geometry of an iron-dominated multipole magnet 

Normal-conducting magnets usually use iron cores to increase the flux density achieved by a given 
current. In such a magnet, the shape of the magnetic field depends mainly on the geometry of the 
iron. In this section, we shall derive the geometry required to generate a pure multipole of given order. 
To simplify the problem, we will make some approximations: in particular, we shall assume that the iron 
core has uniform cross-section and infinite extent along z; that there are no limits to the iron in x or y; 
and that the iron has infinite permeability. The field in a more realistic magnet will generally need to be 
calculated numerically; however, the characteristics derived from our idealized model are often a good 
starting point for the design of an iron-dominated multipole magnet. 

We base our analysis on the magnetic scalar potential, (f, which is related to the magnetic field B 

by: 

B = -gradvj. (30) 

Note that the curl of the field in this case is zero, for any function ip: this is a consequence of the 
mathematical properties of the grad and curl operators. Therefore, it follows from Maxwell's equation ^ 
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Fig. 6: Current distributions for generating pure multipole fields. Top: dipole. Middle: quadrupole. Bottom: 
sextupole. Fields on the left are normal (an positive); those on the right are skew (bn positive). The positive 
y axis is vertically up; the positive x axis is horizontal and to the right. The deviation of the red line from the 
circular boundary shows the local current density. Current is flowing in the positive z direction (out of the page) 
for increased radius, and in the negative z direction for reduced radius. 
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Fig. 7: Superconducting quadrupoles in the LHC. 



that a magnetic field can only be derived from a scalar potential if: (i) there is no current density at the 
location where the field is to be calculated; (ii) there is no time-dependent electric displacement at the 
location where the field is to be calculated. Where there exists an electric current or a time-dependent 
electric field, it is more appropriate to use a vector potential (in which case, the magnetic flux density is 
found from the curl of the vector potential). However, for multipole fields, we have already shown that 
both the divergence and the curl of the field vanish, Eqs. ( [2T] ) and (|22]). Since the curl of the grad of any 



function is identically zero, Eq. ( |22| ) is automatically satisfied for any field B derived using ( |30| ). From 
Eq. ( [2T] ), we find: 

VV = 0, (31) 



where is the laplacian operator. Eq. (31_) is Poisson's equation: the scalar potential in a particular 
case is found by solving this equation with given boundary conditions. 

To determine the geometry of iron required to generate a pure multipole field, we shall start by 
writing down the scalar potential for a pure multipole field. Since the magnetic flux density B is obtained 
from the gradient of the scalar potential, the flux density at any point must be perpendicular to a surface 
of constant scalar potential. However, we already know, from Eq. ( [T4| ), that the magnetic flux density 
at the surface of a material with infinite permeability must be perpendicular to that surface. Hence, to 
generate a pure multipole field in a magnet containing material of infinite permeability, we just need 
to shape the material so that its surface follows a surface of constant magnetic scalar potential for the 
required field. 

We therefore look for a potential (f that satisfies: 
f d d\ 

As we shall now show, an appropriate solution is: 

^ = -\Cn\—sm{n9-^n) (32) 
n 

where: 

X ^ iy — re^^, 



n-l 
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and so: 



X — rcos^, 
y — rsiiiO. 



That Eq. (32) is indeed the potential for a pure multipole of order n can be shown as follows. In polar 



coordinates, the gradient can be written: 

duD 9 Old 

grad(/; = r-^ + --J, (33) 
or r oO 

where r and 9 are unit vectors in the directions of increasing r and 9, respectively. Using: 

f = xcos9 ^ ysm9, 
9 = —xsin9 + ycos9, 



it follows from Eq. (33 ) that: 



grade/:? = (x cos 9 -\- y sin 9) \Cn\r'^ ^ sin{n9 — cpn) — {xsin9 — y cos 9) \Cn\r'^ ^ cos(n9 
= xsin ((n - 1)^ - M \Cn\ r^"^ + y cos ((n - 1)9 - ^ \Cn\ r^"^ 



Thus, the field derived from the potential ([32]) can be written: 

By + iB^ = \Cn\ e-^^-r^-ie^(^-i)^. 

Therefore, if: 
then: 

By + iS, = ar"-ie^(--i)^ = a(x + iyf-^, 

and we see that the potential ([32]) does indeed generate a pure multipole field of order n. 

From the above argument, we can immediately conclude that to generate a pure multipole field, 
we can shape a high permeability material such that the surface of the material follows the curve given 
(in parametric form, with parameter 9) by: 

r"sin(n0-0,) =rj, (34) 

where ro is a constant giving the minimum distance between the surface of the material and the origin. 
The cross-sections of iron-dominated multipole magnets of orders 1, 2 and 3 are shown in Fig.[8j Note 
that r ^ OO forn^ — 0^ ^ integer x tt. Treating each region between infinite values of r as a separate 
pole, we see that a pure multipole of order n has In poles. We also see that the potential changes sign 
when moving from one pole to either adjacent pole: that is, poles alternate between "north" and "south". 
The field must be generated by currents flowing along wires between the poles, parallel to the z axis: to 
avoid direct contribution from the field around the wires, these wires should be located a large (in fact, 
infinite) distance from the origin. 

Note that it is possible to determine the shape of the pole face for a magnet containing any specified 
set of multipoles, by summing the potentials for the different multipole components, and then solving 
for r as a function of 9, for a fixed value of the scalar potential. Magnets designed to have more than 
one multipole component are often known as "combined function" magnets. Perhaps the most common 
type of combined function magnet is a dipole with a quadrupole component: such magnets can be used 
to steer and focus a beam simultaneously. The shape of the pole faces and the field lines in a dipole with 
(strong) quadrupole component is shown in Fig. [9] 
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Fig. 8: Pole shapes for generating pure multipole fields. Top: dipole. Middle: quadrupole. Bottom: sextupole. 
Fields on the left are normal (a^ positive); those on the right are skew (bn positive). The positive y axis is 
vertically up; the positive x axis is horizontal and to the right. The poles, shown as black (north) or grey (south), 
are constructed from material with infinite permeability. 
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Fig. 9: Pole shapes for dipole magnet with additional quadmpole component. 





T 



Fig. 10: Iron-dominated quadrupole magnet for the EMMA Fixed-Field Alternating Gradient accelerator at Dares- 
bury Laboratory. Left: magnet cross-section ^8J . Right: magnet prototype ^9J . 



In practice, some variation from the "ideal" geometry is needed, to account for the fact that the 
material used in the magnet has finite permeability, and finite extent transversely and longitudinally. The 
wires carrying the current that generates the magnetic flux are arranged in coils around each pole; as 
we shall see, the strength of the field is determined by the number of ampere-turns in each coil. An 
iron-dominated electromagnetic quadrupole is shown in Fig.[TOj 

To complete our discussion of methods to generate multipole fields, we derive an expression for 
the field strength in an iron dominated magnet with a given number of ampere-turns in the coil around 



each pole. To do this, we consider a line integral as shown in Fig.[TT] In the figure, we show a quadrupole; 
however the generalisation to other orders of multipole is straightforward. Note that, in principle, the 
coils carrying the electric current, and the line segment C3, are an infinite distance from the origin (the 
centre of the magnet). 

Using Maxwell's equation ([3]), with constant (zero) electric displacement, and integrating over the 
surface S bounded by the curve Ci -\- C2 ~^ gives: 

/ curlH-dS^ [ J-dS^-NI. 
Js Js 

Note that the surface is oriented so that the normal is parallel to the positive z axis; and the coil around 
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Fig. 11: Contour for line integral used to calculate the field strength in an iron-dominated quadrupole. 



each pole consists of N turns of wire carrying current /. Applying Stokes' theorem ([7]) gives: 

/ H'df+ f H ' dt+ j H • dl^ -NI. 

J C^ J Co J 



We know, from Eq. \\2\ , that the normal component of the magnetic flux density B is continuous across 
a boundary. Then, since B = jiH, it follows that for a finite field between the poles, and for /i ^ oc, the 
magnetic intensity H vanishes within the poles. Also, the field is perpendicular to the line segment C2. 
Thus, the only part of the integral that makes a non-zero contribution, is the integral along Ci from the 
face of the pole to the origin. Hence: 



Jo 



Br{r) 



dr = NI. 



(35) 



/o Mo 

The contour Ci is chosen so that along this contour, the field has only a radial component, parallel to the 



contour. From Eq. (24), we see that for a multipole of order n, along this contour we have: 

Bj^ = I Cfi ^ . 
Thus, we find by performing the integral in Eq. ( [35] ): 



n 



\Cn\= fioNI--. 



For a normal multipole, the field is given by: 



By + iBx 



fiQuNI f X + iy 



n-l 



For example, in a normal quadrupole (n = 2), the field gradient is given by: 

dBy _ 2/io7V/ 



(36) 
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5 Multipole decomposition 

In the previous section, we derived the current density distributions and material geometries needed to 
generate a pure multipole field of a given order. However, the distributions and geometries required are 
not perfectly achievable in practice: the currents and materials have infinite longitudinal extent; and we 
require either a current that exists purely on the surface of a cylinder, or infinite permeability materials 
with infinite transverse extent. 

Real multipole magnets, therefore, will not consist of a single multipole component, but a super- 
position of (in general) an infinite number of multipole fields. The exact shape of the field can have a sig- 
nificant impact on the beam dynamics in an accelerator. In many simulation codes for accelerator beam 
dynamics, the magnets are specified by the multipole coefficients: this is because simple techniques exist 
for approximating the effect, for example, of sextupole, octopule, and other higher-order components in 
the field of a quadrupole magnet. The question then arises how to determine the multipole components 
in a given magnetic field. 

At this point, we can make a distinction between the design field of a magnet, and the field that 
exists within a fabricated magnet. The design field is one that is still in some sense "ideal"; though the 
design field for a quadrupole magnet (for example) will contain other multipole components, because 
the design has to respect practical constraints, i.e. the magnet will have finite longitudinal and trans- 
verse extent, any currents will flow in wires of non-zero dimension, and any materials present will have 
finite (and often non-linear) permeability. Usually, one attempts to optimize the design to minimize the 
strengths of the multipole components apart from the one required: the residual strengths are generally 
known as systematic multipole errors. These errors will be present in any fabricated magnet, although, 
because of construction tolerances, the errors will vary between any two magnets of the same nominal 
design. The differences between the multipole components in the design field and the components in a 
particular magnet are known as random multipole errors. 

The effects of systematic and random multipole errors on an accelerator, and hence the specifi- 
cation of upper limits on these quantities, can usually only be properly understood by running beam 
dynamics simulations. Therefore, accelerator magnet (and lattice) design often proceeds iteratively. 
Some initial estimate of the limits on the errors is often needed to guide the magnet design; but then any 
design that is developed must be studied by further beam dynamics simulations to determine whether 
improvements are needed. 

It is therefore important to be able to determine the multipole components in a magnetic field from 
numerical field data: these data may come from either a magnetic model (i.e. from the design of a mag- 
net), or from measurements on a real device. There are different procedures that can be used to achieve 
the "decomposition" of a field into its multipole components. In this section, we shall consider methods 
based on Cartesian and polar representations of two-dimensional fields (i.e. fields that are independent of 
the longitudinal coordinate). In Section [6] we shall consider decompositions of three-dimensional fields 
(i.e. fields that have explicit dependence on longitudinal as well as transverse coordinates). However, we 
first consider an important concept in the discussion of multipole field errors, namely how the symmetry 
of a multipole magnet leads to "allowed" and "forbidden" higher-order multipoles. 

5.1 Multipole symmetry, "allowed" and "forbidden" higher-order multipoles 

A pure multipole field of order n can be written: 

By + zS^ = \Cn\ e-^^-r^-ie^(^-i)^. (37) 

The parameter 0^ characterises the angular orientation of the magnet around the z axis. In particular, 
from Eq. ([34]), we see that a change in (j)n by na is equivalent to a rotation of the coordinates (a change 
in 9) by —a. Thus, a rotation of a magnet around the z axis by angle a may be represented by a change 



in (j)n by na. In particular, if the magnet is rotated by vr/n, then from Eq. (37), we see that the field at 
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any point simply changes sign: 



if ^ + TT, then B ^ -B. (38) 

This property of the magnetic field is imposed by the synmietry of the magnet. In a real magnet, it will 
not be satisfied exactly, because random variations in the geometry will break the symmetry. However, it 
is possible to maintain the synmietry exactly in the design of the magnet; this means that although higher 



order multipoles will in general be present, only those multipoles satisfying the symmetry constraint ( |38| ) 
can be present. These are the "allowed" multipoles. Other multipoles, which must be completely absent, 
are the "forbidden" multipoles. 

We can derive a simple expression for the allowed multipoles in a magnet designed with symmetry 
for a multipole of order n. Consider an additional multipole (a "systematic error") in this field, of order 
m. By the principle of superposition, the total field can be written as: 

By + iB^ = \Cn\ e-^^-r^-ie^(^-i)^ + \Cm\ e-^^-r^-ie^(^-i)^ 

The geometry is such that under a rotation about the z axis through vr/n, the magnet looks the same, 
except that all currents have reversed direction: therefore the field simply changes sign. Under this 
rotation 0^ 0^ + tt; however, 0m ^ + rrm/n. This means that we must have: 

— TT -1 
e — —1. 



Therefore, m/n must be an odd integer. Assuming that m ^ n (i.e. the multipole error is of a different 
order than the "main" multipole field), then: 

- = 3,5,7,... (39) 

n 

Thus, for a dipole, the allowed higher order multipoles are sextupole, decapole, etc.; for a quadrupole, 
the allowed higher order multipoles are dodecapole, 20-pole, etc. The fact that the allowed higher order 
multipoles have an order given by an odd integer multiplied by the order of the main multipole is a 
consequence of the fact that magnetic poles always occur in north-south pairs. This is illustrated for 
a quadrupole in Fig. [T2| here, we see that to maintain the correct rotational synmietry (with the field 
changing sign under a rotation through 7r/2) the first higher-order multipole must be constructed by 
"splitting" each main pole into three, then into five, and so on. 

The field in a real magnet will contain all higher order multipoles, not just the ones allowed 
by symmetry. However, it is often the case that the allowed multipoles dominate over the forbidden 
multipoles. 



5.2 Fitting multipoles: Cartesian basis 

Suppose we have obtained a set of numerical field data, either from a magnetic model, or from measure- 
ments on a real magnet. To determine the effect of the field on the beam dynamics in an accelerator, it is 
helpful to know the multipole components in the field. One way to compute the multipole components 
is to fit a polynomial to the field data. For example, if we consider a normal multipole (coefficients Cn 
are all real), the vertical field along the x axis (i.e. for y = 0) is given by: 

oo 

= (40) 

n=l 

The number of data points determines the highest order multipole that can be fitted. Fitting may be 
achieved using, for example, a routine that minimises the squares of the residuals between the data and 
the fitted function. However, although this procedure can, in principle, produce good results, it is not 
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Fig. 12: Normal quadmpole field (left) and dodecapole field (right). The dodecapole is the first higher-order 
multipole with the same rotational symmetry as the quadrupole (under a rotation by 7r/2, north and south poles 
interchange). 



very robust. In particular, the presence of multipoles of higher order than those included in the fit can 
affect the values determined for those multipoles that are included in the fit. We can illustrate this as 
follows. 

Let us construct a quadrupole field (n =2), and add to it higher order multipoles of order 3, 4, 5 
and 6. The values of the coefficients (actual values, and fitted values in two different cases) are given 
in Table [1} The field By/B^Qf is plotted as a function of x/i?ref in Fig. 13 the field data are shown as 
points, while the fit, including multipoles up to order 6, is shown as a line. Also shown is the deviation 
ABy/Bj-Qf from an ideal quadrupole field, i.e. ABy is the contribution of the higher order multipoles. 
We see that if we base the fit on all the multipoles that are present (i.e. up to order 6), then we obtain 
accurate values for all multipole coefficients. 

However, in general, multipoles of all orders are present, while our fit is based on a finite number 
of multipoles. If we try to fit the data in our illustrative case using multipoles up to order 5 only (i.e. 
omitting the order 6 multipole that is present), then we see that there is an impact on the accuracy with 
which we determine the lower-order multipoles. This can be seen in the final column of Table [TJ there 
is even an error in the value that we determine for the quadrupole strength. When we plot the fit against 
the field data, we see that there is some small residual deviation between the data and the fit: this is to 
be expected, since the function we are using to obtain the fit does not match exactly the function used to 
generate the data. Although not visible in the total field, plotted in Fig. [T4| the difference between the fit 
and the data is apparent in the plot of the deviation from the quadrupole field. 

Our concern is that the presence of higher-order multipoles has affected the accuracy with which 



Table 1: Actual and fitted multipole values for a quadrupole field with artificially constructed multipole errors. 

n actual coefficient a^n fitted coefficient a„ 



(n < 6) (n < 5) 



2 


1.000 


1.000 


0.9972 


3 


0.010 


0.010 


0.0100 


4 


0.001 


0.001 


0.0131 


5 


0.010 


0.010 


0.0100 


6 


0.010 


0.010 
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X/Rref X/Rref 

Fig. 13: Measured (points) and fitted (fine) field in a quadmpole with higher-order multipole errors of order 3, 4, 
5 and 6. Multipoles up to order 6 are fitted. Left: total field. Right: deviation from quadrupole field. 




X/Rref X/Rref 

Fig. 14: Measured (points) and fitted (line) field in a quadrupole with higher-order multipole errors of order 3, 4, 
5 and 6. Multipoles up to order 5 are fitted. Left: total field. Right: deviation from quadrupole field. 



we determine the lower-order multipoles, even down to the quadrupole field strength. This can have 
significant implications for beam dynamics: the effect of a linear focusing error in a beam line (from 
some variation in the quadrupole strength) can be very different from the effects of higher-order multipole 
errors. For example, if one is measuring the betatron tunes or the beta functions in a storage ring, these 
can be very sensitive to linear focusing errors, and relatively insensitive to higher order multipoles. 
Determining the multipole coefficients using a polynomial fit can lead to inaccurate predictions of the 
linear behaviour of the beam line, depending on the higher-order multipoles present in the magnets. 

The problem is that we have based our fit on monomials, i.e. powers of x. Our fit is a sum of these 
monomials, with coefficients determined from the data. However, it is possible to obtain a fit to data 
generated using one monomial, with a different monomial. For example, if one constructs data which is 
purely linear in x, then one can obtain a fit using a monomial (even though the fit will not be as good 
as one obtained using a monomial x). Mathematically, the basis functions we are using (monomials in 
this case) are not orthogonal: the coefficients we determine depend on the which set of basis functions 
we choose to use. A more robust technique would use basis functions that are orthogonal, i.e. the 
coefficients we determine will be the same, no matter which set of functions we choose. Fortunately, 
there exists an appropriate set of functions that provides an orthogonal basis for multipole fields. We 
discuss this basis in the following section, |5.3[ The advantage of orthogonal basis functions is that the 
coefficients we determine for different terms in the fit are independent of which terms we include in the 
fit; for example, the quadrupole strength that we find in a particular magnet will be the same, irrespective 
of which higher-order terms we include in the fit, and which higher-order terms are actually present. 
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5.3 Fitting multipoles: polar basis 

From Eq. ([24]) we know that the field in a multipole magnet can be written in polar coordinates as: 

oo 
n=l 

We see that if we make a set of measurements of and Bq at different values of 9 and fixed radial 
distance r, then we can obtain the coefficients Cn by a discrete Fourier transform. 

Suppose we make M measurements of the field, at 6> = 0^, where: 

Tfi 

9m = 27T — , m = 0,l,2...M-l. (41) 

We write the measurement at 9 = 9^ as S^; note that Bj^ is a complex number, whose real and 
imaginary parts are given by the azimuthal and radial components of the field at 6> = 6^^. 

Now we construct, for a chosen integer n': 

M-l M-1 oo 

m=0 m=0 n=l 

where tq is the radial distance at which the field measurements are made. The summation over m on the 
right hand side vanishes, unless n = n^ Thus, we can write: 

M-l 

If we relabel as n, then we see that the multipole coefficients Cn are given by: 

M-l 

= — ^ Vi?^e-2--M. (42) 



The advantage of this technique over that in section p^ is that the basis functions used to construct 
the fit are of the form e^^^, for integer n. These functions are orthogonal: mathematically, this means 
that: 



27T 



nn' 5 



where the Kronecker delta function 5nn' = 1 if n = n^ and 5nn' = if n 7^ n^ The important 
consequence for us, is that the value we determine for any given multipole using Eq. ( |42| ) is independent 
of the presence of any other multipoles, of higher or lower order. 

A further advantage of using the polar basis instead of the Cartesian basis, comes from the depen- 
dence of the field on the radial distance. Suppose that the field data are measured (or obtained from a 
model) with accuracy AS^. Then the accuracy in the multipole coefficients will be: 

AS. 



AC. 



^0 



We obtain better accuracy in the multipole coefficients if we choose the radius ro on which the measure- 
ments are made, to be as large as possible. Furthermore, the accuracy in the fitted field will be: 

/ \ n—l 

AB ^ ACni^j 

We obtain improved accuracy in the field for r < ro; but the accuracy reduces quickly (particularly 
for higher-order multipoles) for r > tq. It is important to choose the radial distance ro large enough 
to enclose all particles likely to pass through the magnet, otherwise results from tracking may not be 
accurate. 
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5.4 Multipole decomposition: some comments 

In this section, we have considered two techniques for deriving the multipole components of two- 
dimensional magnetostatic fields. We have seen that while the multipole components can be obtained, in 
principle, from a simple least-squares fit of a polynomial to the field components along one or other of 
the coordinate axes, there are advantages to basing the fit on field data obtained on a circle enclosing the 
origin, with as large a radius as possible. In the next section, we shall see how the idea of a multipole 
expansion can be generalised to three dimensions, and how a multipole decomposition can be performed 
in that case. However, it is worth pausing to consider in a little more detail some of the reasons for 
wishing to represent a field as a set of (multipole) modes. 

It is of course possible to represent a magnetic field using a set of numerical field data, giving the 
three field components on points forming a "mesh" covering the region of interest. In some ways, this 
is a very convenient representation, since it is the one usually provided directly by a magnetic modelling 
code: further processing is usually required to arrive at other representations. However, while a numerical 
field map in two dimensions is often a practical representation, in three dimensions the amount of data in 
even a relatively simple magnet can become extremely large, especially if a high resolution is required for 
the mesh. A multipole representation, on the other hand, provides the description of a magnetic field as 
a relatively small set of coefficients, from which the field components at any point can be reconstructed, 
using the basis functions. In other words, a multipole representation is more "portable" than a numerical 
field map. 

Secondly, a representation based on a multipole expansion lends itself to further manipulation in 
ways that a numerical field map does not. For example, any noise in the data (from measurement or 
computational errors) can be "smoothed" by suppression of higher-order modes. Conversely, random 
errors can be introduced into data based on a model with perfect symmetry by introducing multipole 
coefficients corresponding to "forbidden" harmonics. There will of course be issues surrounding the 
suppression or enhancement of errors by adjusting the multipole coefficients; however, one benefit of 
this approach is that for any set of multipole coefficients, the field is at least a physical field, in the sense 
of satisfying Maxwell's equations. The same will, for example, not usually be true if a general smoothing 
algorithm is applied to a numerical field map. 

Finally, one of the main motivations for performing a multipole decomposition of a field is to 
provide data in a format appropriate for many beam dynamics codes. Accuracy is one criterion often 
important for beam dynamics codes: efficiency is another. Characterisation of a storage ring frequently 
requires tracking of thousands of particles over hundreds or thousands of turns, through a beam line that 
can easily consist of hundreds of magnetic elements. Numerical integration of the equations of motion 
for a particle in a numerical field map is generally too slow to be a practical method. There are many 
techniques that can be used to improve the efficiency of particle tracking in accelerators: one of the most 
common is the "thin lens" method. The dynamical effects of dipoles and quadrupoles usually need to 
be represented with high accuracy. Fortunately, for these magnets, it is possible to write down accurate 
solutions to the equations of motion in closed form, allowing tracking through a magnet of given length 
to be performed in a single step. The same is not true for sextupoles, or higher-order multipoles; however, 
it is usually sufficiently accurate to represent such magnets by a model in which the length of the magnet 
approaches zero, but where the integrated strength (the multipole coefficient multiplied by the length) 
remains constant. For such a "thin lens" it is possible to write down exact solutions to the equations 
of motion, allowing tracking again to be performed in a single step. A quadrupole with higher-order 
multipole errors can be represented as a "long" perfect quadrupole field, with a set of "thin" multipoles 
at one end, or at the centre. However, construction of such a model for a tracking code requires a 
multipole decomposition of the field obtained from a magnet modelling code. 



We should emphasise that in our discussion of multipole decomposition, here and in sections 5.2 



and |5.3[ we have made no clear distinction between field data obtained from a computational model, 
or from measurement of a real magnet. Of course, it is much easier to obtain the data required from a 
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computational model: it is then quite straightforward to perform the required decomposition to determine 
the values of the various multipole coefficients. Unfortunately, the data do not include manufacturing 
errors, which can be very important. Measurements provide more realistic data: however, many other 
issues need to be addressed, including accuracy of field measurements, alignment of the measurement 
instruments with respect to the magnet, etc. Such issues are beyond the scope of our discussion. 

6 Three-dimensional fields 

In the previous sections, we have restricted ourselves to the case where the magnetic field is independent 
of the longitudinal coordinate. The multipole modes that we can use for such fields actually provide a 
good description for many accelerator multipole magnets, even though such magnets of course have finite 
length. The ends or "fringe fields" of dipoles, quadrupoles and so on, where the field strengths often vary 
rapidly with longitudinal position, cannot be accurately represented by two-dimensional fields; however, 
in many accelerators, only the fringe fields of dipoles have a significant impact on the dynamics. 

However, there are cases where a full three-dimensional description of a magnetic field is desir- 
able, or even necessary. For example, the fields of insertion devices (wigglers and undulators) are often 
represented as a sequence of short dipoles of alternating polarity; however, where the period becomes 
small compared with the aperture, the three-dimensional nature of the field can start to have effects that 
cannot be ignored. There can even be cases where "conventional" multipoles designed for special situa- 
tions (for example, where very wide aperture is required, and where the length of the magnet needs to be 
short, because of space constraints) can have fringe fields that affect the dynamics to a significant extent. 

It is therefore of somewhat more than purely academic interest to consider how two-dimensional 
multipole representations may be generalised to three dimensions. As usual, there are many different 
ways to approach the problem: the method that is used will often depend on the problem to be solved. 
In the following sections, we describe two rather general methods that may be of use in many situations 
arising in accelerators. First, we consider a field expansion based on Cartesian modes. While this 
provides some nice illustrations, the Cartesian expansion does have some disadvantages. To address 
these disadvantages, we describe how a field expansion based on polar coordinates can be performed. 

6.1 Cartesian modes 

Consider the field given by: 

Bx = — sin kxxsmh kyU sin kzZ, (43) 

ky 

By — Bq cos kxX cosh kyy sin kzZ^ (44) 
k 

Bz = B{)-^ coskxXsinYikyH coskzZ. (45) 

ky 

As may easily be verified, this field satisfies: 

curlS = 0. 

Furthermore, the equation: 

divS = 

is satisfied if: 

kl^kl + kl. (46) 

We conclude that, as long as the constraint ( |46| ) is satisfied, that the fields (|43])-(|45]) provide solutions to 
Maxwell's equations in regions with constant permeability, and static (or zero) electric fields. Of course, 
it is possible to find similar sets of equations but with different "phase" along each of the coordinate axes, 
and with the hyperbolic trigonometric function appearing for the dependence on x or z, rather than y. 
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By superposing fields, with appropriate variations on the form given by Eqs. ([43|)-([45|), it is possible to 
construct quite general three-dimensional magnetic fields. For example, a slightly more general field than 



that given by Eqs. ([43])-([45)) can be obtained simply by superposing fields with different mode numbers 
and amplitudes: 



k 

B{kx, kz)j^ sin kxxsmh kyy sin kzZdkx dkz^ 



= -// 

By — J J B{kx,kz) COS kxX cosh kyy sin kzZ dkx dkz, 



B. 



k 

B{kx, kz)j^ cos fca^xsinh kyy cos kzzdkx dkz 



(47) 
(48) 
(49) 



In this form, we see already how to perform a mode decomposition, i.e. how we can determine the 
coefficients B{kx^ kz) as functions of the "mode numbers" kx and kz. If we consider in particular the 



vertical field component on the plane y = yo, then we have from ( [48) ) 

B.j 



cosh kyyo 



kz) cos krX sin kzZ dkr dkz- 



Hence, B{kx, kz) may be obtained from an inverse Fourier transform of By{x^ z)/cosh kyyo. Given field 
data on a grid over x and z, then we can perform numerically an inverse discrete Fourier transform, to 
obtain a set of coefficients B{kx,kz). Note that once we have obtained these coefficients, then we can 
reconstruct all field components at all points in space. This is an important consequence of the strong 
constraints on the fields provided by Maxwell's equations: in general, for a static field, if we know 
how one field component varies over a two-dimensional plane, then we can deduce how all the field 
components vary over all space (on and off the plane). 

Let us consider an example. To keep things simple, we shall again work with the case where the 
field is independent of one coordinate: now, however, we shall assume that the fields are independent 
of the horizontal transverse, rather than the longitudinal coordinate. This may be a suitable model for 
a planar wiggler or undulator with very wide poles. The model may of course be extended to include 
dependence of the fields on the horizontal transverse coordinate: although our immediate example strictly 
deals with a two-dimensional field, the extension to three dimensions is quite straightforward. 

Suppose that the mode amplitude function B{kx^kz) has the form: 

B(kx,kz) = 6{kx)B{kz), (50) 
where 6{kx) is the Dirac delta function. The delta function has the property that, for any function f{kx): 

S{kx)f{kx)dkx = f{0). 



Using (50) in Eqs. (HTb to (49) gives: 



Bcr 



B., 



B, 



0, 



B{kz) cosh kzy sin kzZ dkz, 



There is no horizontal transverse field component, and the vertical and longitudinal field components 
have no dependence on x: we have a two-dimensional field. In a plane defined by a particular value for 
the vertical coordinate, y = yo, the vertical field component is given by: 



By{z) = cosh kzy / B{kz) sin kzzdk^ 



I 
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The mode amplitude function can be obtained from a Fourier transform of the vertical component of the 
magnetic field on the plane y = yo. Usually, we will have a finite set of field data, obtained from a 
magnet modelling code, or from measurements on a real device. 

Suppose that we have a data set of 2M + 1 vertical field measurements, taken at locations: 

m 

y^yo, ^^M^' ^^^^ 

where m is an integer in the range —M<m< M. The field at any point is given by: 

M 



By{y^z)— Bjn cosh mkzy sin. mkzZ, 



m=-M 

where: ^ 

ky ^ ~~~r • 

Note that in this case, the field is antisymmetric about z = 0, i.e. 

By{y, -z) = -By{y,z). 
The mode amplitudes are obtained by: 

1 1 ^ 

m'=-M 

Note that, because of the antisymmetry of the field: 

As a specific numerical example, let us construct an "artificial" data set along a line y = yo = 0.25, 
and with i = 3. The data are constructed using a function that gives a sinusoidal variation in the field 
along z up to l^l < 1.25; then a continuous and smooth (continuous first derivative) fall-off to zero field 
for 1^1 > 1.5. For this numerical example, we do not worry unduly about units: the reader may assume 
lengths in cm, fields in kG, or any other preferred units. Initially, we take M = 40, i.e. we assume we 
have 81 measurements of the field (or, we have computed the field from a model at 81 equally- spaced 
points along z; strictly speaking, because we are dealing with the case that the field is antisymmetric in z, 
we need only half this number of field measurements or computations). The field "data", the fitted field 



(reconstructed using the mode amplitudes) in y = 0.25, and the mode amplitudes, are shown in Fig.[T5 

It is interesting to compare with the situation where we have only 31 field measurements or com- 
putations, i.e. M = 15. Using the same function that we used to construct the data set with 81 data 



points, we produce the fit and the mode amplitudes shown in Fig. [T6j Comparing Figs. [T5] and [T6j we 
see that in both cases, the fitted field does pass exactly through all the data points. This is a necessary 
consequence of the fit, which is based on a discrete Fourier transform of the data points. However, using 
only 3 1 data points, there is a significant oscillation of the fitted field between the data points in the region 
1.5 < l^l < 3, where the field is actually zero (by construction). This is a consequence of the fact that 
we have "truncated" some modes with non-negligible amplitude. The mode amplitudes in both cases are 
the same for mode numbers — 15 < m < m; but with 81 data points we can determine amplitudes for a 
larger number of modes, which gives us a more accurate interpolation between the data points. 

Having obtained fits to the field in the plane y = 0.25, we can reconstruct the field at any point, 
on or off the plane. It is often of interest to look at the mid-plane; usually, this is defined by y = 0. In 
this plane, we do not have any field data. However, we can compare the field produced by the fits with 



81 data points and with 31 data points: these fields are shown in Fig. 17 We see that the fit based on 
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Fig. 15: Left: Field data (points) and fit (fine) in a magnet with dependence of the field on longitudinal coordinate 
z. Right: Mode amplitudes. The field data consist of 81 measurements (or computations) at equally-spaced points 
from z = —3to z = +3, and y = 0.25. 
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Fig. 16: Left: Field data (points) and fit (line) in a magnet with dependence of the field on longitudinal coordinate 
z. Right: Mode amplitudes. The field data consist of 31 measurements (or computations) at equally-spaced points 
from z = —3to z = +3, and y = 0.25. 




31 data points produces an essentially identical field on = as the fit based on 81 data points. (The 
data points in each case are taken on the plane y = 0.25). This is a consequence of the "suppression" of 
higher-order modes, that arises from the hyperbolic dependence of the field on the y coordinate. 

To emphasise the significance of the hyperbolic dependence of the field on the vertical coordinate, 
we can look at the variation of By with y, for a given value of z. We choose z = 0.25, which corresponds 
to a peak in the vertical field component as a function of z. The variation of By with y for the two cases 



(fit based on 81 data points, and fit based on 31 data points) is shown in Fig. 18 
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Fig. 18: Vertical field component as a function of y, for z = 0.25. The field is determined from fits to the data 



shown in Figs. 15 and 16 Left: fit determined from data set with 81 data points. Right: fit determined from data 
set with 31 data points. 
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Fig. 19: Vertical field component as a function of y, for z = 0.25. The field is determined from fits to the data 



shown in Figs. [T5] and [16] Left: fit determined from data set with 81 data points. Right: fit determined from data 
set with 31 data points. 



Up to ^ = 0.25, the two fits give essentially the same field. However, if we try to extrapolate 
beyond this plane (the plane on which the fit was performed), we see dramatically different behaviour. 
Fig. [19] compares the vertical field component obtained from the two fits (81 data points and 31 data 
points), again at z = 0.25, but now with a range of y from to 0.5. In one case (81 data points), the field 
increases to a maximum before dropping rapidly. In the other case (31 data points), the field increases 
monotonically over the range. The reason for the different behaviour is the additional modes in the fit 
to the set of 81 data points. These higher order modes make only a small additional contribution to the 
field for \y\ < 0.25; but for values of the vertical coordinate beyond this value, because of the hyper- 
bolic dependence of y, the contribution of these modes becomes increasingly significant, and eventually, 
dominant. 

The behaviour of the field fits for |^| > 0.25 is a clear illustration of why it is dangerous to 
extrapolate the fit beyond the region enclosed by the plane of the fit. In this case, because of the symmetry 
in the vertical direction, the region enclosed is between the planes y = —0.25 and y = +0.25. The "safe" 
region is also bounded in z, by z = —0.3 and z = +0.3; because we use discrete mode numbers in z, 
the fitted fields will in fact be periodic in z, and will repeat with period z = 0.6. In general, there will be 
similar periodicity in x\ however, in this particular example, we analysed a field that was independent of 
X, so the "safe" region of the fit is unbounded in x. 
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6.2 Cylindrical modes 



The Caretsian modes discussed in Section [6^1] are often useful for describing fields in insertion devices, 
particularly those that have weak variation of the field with x, and periodic behaviour in z (over some 
range): because the modes "reflect" the geometry, it is often possible to achieve good fits to a given 
field using a small number of modes. To maximise the region over which the fit is reliable, one needs to 
choose a plane with a value of y as large as possible, with x and z extending out as far as possible on 
this plane. For a planar undulator or wiggler, it is often possible to choose a plane close to the pole tips 
in which x in particular extends over the entire vacuum chamber. 

However, for other geometries, the Cartesian modes may not provide a convenient basis. For 
example, if the magnet has a circular aperture, then the plane that provides the largest range in x is the 
mid-plane, y = 0, and as y increases, the available range in x decreases. To base the fit on the Cartesian 
basis requires some compromise between the range of reliability in the horizontal transverse and vertical 
directions. 

Fortunately, it is possible to choose an alternative basis for magnets with circular aperture, in 
which the field fit can be based on the surface of a cylinder inscribed through the magnet. In that case, 
the radius of the cylinder can be close to the aperture limit, maximising the range of reliability of the fit. 
The appropriate modes in this case are most easily expressed in cylindrical polar coordinates. 

A field with zero divergence and curl (and hence satisfying Maxwell's equations for static fields 
in regions with uniform permeability) is given by: 

Br ^ j dk^^Bnik^) r^{k^r) sinnO cos k^z, (52) 
Bo = dkzy^ Bn{kz) T^In{kzr) cos n0 cos kzZ, (53) 
Bz = - dkz^Bn{kz) In{kzr)smn9 sin kzZ. (54) 

n 

Here, In{kzr) is the modified Bessel function of the first kind, of order n. Modified Bessel functions of 



the first kind for order n = to n = 3 are plotted in Fig. [20j For small values of the argument ^, the 
modified Bessel function of order n has the series expansion: 

For larger values of the argument, the modified Bessel functions /^(O increase exponentially. This is 
significant: it means that if we fit a field to data on the surface of a cylinder of given radius, then residuals 
of the fit will decrease exponentially within the cylinder towards r = 0, and increase exponentially 
outside the cylinder with increasing r. The "safe" region of the fit will be within the cylinder. 



Note that Eqs. ([52|)-([54|) may be generalised to include different "phases" in the azimuthal angle 
9 and the longitudinal coordinate z. 

An attractive feature of the polar basis is that it is possible to draw a direct connection between 
the three-dimensional modes in this basis and the multipole components in a two-dimensional field. 
Consider a mode amplitude Bn{kz) given (for some particular value of n) by: 

Bn{k,) = 2"ri(l + n)C„4M' (56) 

where 5() is the Dirac delta function, and Cn is a constant. Substituting these mode amphtudes into 
Eqs. (|52|)-(|54l>, using the expansion ( [55] ), and performing the integral over kz gives: 



Br = ^Cnr"-^sinn^, 
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Fig. 20: Modified Bessel functions of the first kind, of order n = to n = 3. 



= 0. 

Comparing with Eq. ([24]), we see that this is a multipole field of order n. Thus, a two-dimensional 



multipole field is a special case of a three-dimensional field (p2|)-(|54|), with mode coefficient given by 
Eq. ([56]). 

In general, the mode coefficients Bn{kz) may be obtained by a Fourier transform of the field on 



the surface of a cylinder of given radius. For example, it follows from Eq. ( [52] ) that: 



An elegant feature of the polar basis, as compared to the Cartesian basis discussed in Section [6?T| is that 
the modes reflect the real periodicity of the field in the angle coordinate 9. In the Cartesian basis, the 
modes were periodic in x, although the field, in general, would not have any periodicity in x. 

Since the mode coefficients Bn(kz) are related to the multipole coefficients in a two-dimensional 
field, we can use these coefficients to extend the idea of a multipole to a three-dimensional field. Strictly 
speaking, the mode coefficients Bn{kz) are related to the field by a two-dimensional Fourier transform; 
however, we can perform a one-dimensional inverse Fourier transform (in the z variable) to obtain a set 
of functions which represent, in some sense, the "multipole components" of a three-dimensional field as 
a function of z. Here, we use the term "multipole components" rather loosely, since a multipole field is 
strictly defined only in the two-dimensional case (i.e. for a field that is independent of the longitudinal 
coordinate). A quantity that is perhaps easier to interpret is the contribution to the field at any point made 
by the mode coefficients Bn{kz) with a given n. For n = 1, the field components Bj. and Bq at any point 
in z will behave as for a dipole field; forn = 2, and Bq will behave as for a quadrupole field, and so 
on. 

As an illustrative example, we consider the field in a specific device: the wiggler in a damping 
ring for TESLA (a proposed linear collider) |10|. This wiggler has a peak field of 1.6 T and period 
400 mm; the total length of wiggler in each of the TESLA damping rings would be over 400 m. The field 
in the wiggler has been studied extensively, because of concerns that dynamical effects associated with 
the nonlinear components in the field would limit the acceptance of the damping ring [ 1 1 1. A model was 
constructed for one quarter period of the magnet, which allowed the field at any point within the body of 
the magnet to be computed. Effects associated with the ends of the wiggler were neglected, but could in 
principle be included in the study. By performing a mode decomposition using the techniques described 
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Fig. 21: Fit to the field of one quarter of one period of the TESLA damping ring wiggler. 



above, it was possible to construct an accurate dynamical model allowing fast tracking to characterise 
the acceptance of the damping ring. The methods used for the dynamical analysis are beyond the scope 
of the present discussion; however, we present the results of the analysis relating directly to the field, to 
illustrate the methods described in this section. 

A model of the wiggler was used to compute the magnetic field on a mesh of points bounded by 
a cylinder of radius 9 mm, within one quarter period of the wiggler. Although all field components were 
computed on the mesh, which covered the interior of the cylinder as well as the surface, only the radial 
field component on the surface of the cylinder was used to calculate the mode amplitudes. The fit can be 
validated by comparing the field "predicted" by the fit with the field data (from the computational model) 
not used directly in the fitting procedure. A fit achieved using 7 azimuthal and 100 longitudinal modes 
is shown in Fig. [21] Each plot shows the variation of the vertical field as a function of one Cartesian 
coordinate, with the other two coordinates fixed at zero. In the vertical direction, the range shown is 
from the mid-plane of the wiggler to close to the pole tip. Note that the variation in the field in the 
transverse (x and y) directions is very small, less than 0.1% of the maximum field. It appears from 
Fig. [21] that there is very good agreement between the fit (line) and the field data (circles) within the 
cylinder on the surface of which the fit was performed. 

The quality of the fit can be further illustrated by plotting the residuals, i.e. the difference between 
the fitted field and the field data. The residuals for the vertical field component on two horizontal planes, 
y = mm and y = 6 mm are shown in Fig. [22j Note that to produced "smooth" surface plots, we 
interpolate between the mesh points used in the computational model. On the mid-plane of the wiggler. 
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Fig. 22: Residuals of the fit to the field of one quarter of one period of the TESLA damping ring wiggler. 



the residuals are less than 1 gauss (recall that the peak field is 1.6 T); the region shown in the left hand 
plot in Fig. [22] lies entirely within the surface of the cylinder used in the fit. On the plane y = 6 mm, 
the residuals are somewhat larger, and show an exponential increase for large values of the horizontal 
transverse coordinate: but note that for values of x larger than about 6.7 mm, the points in the plot are 
outside the surface of the cylinder used for the fit. In the longitudinal direction, the residuals appear 
to be dominated by very high frequency modes: this suggests that it may be possible to reduce the 
residuals still further by increasing the number of longitudinal modes used in the fit. However, this fit 
was considered to be of sufficient quality to allow an accurate determination of the effect of the wiggler 
on the beam dynamics to be made. 

The tools used for study of the beam dynamics were based on the mode coefficients determined 
by the fitting procedure. Once a fit has been obtained and shown to be of good quality, then, strictly 
speaking, further analysis of the field is not required. However, it is interesting to compute, from the 
mode amplitudes, the contribution to the field in the wiggler from different "multipole" components, as 
a function of longitudinal position. As described above, the contribution of a multipole of order n is 
obtained by a one-dimensional (in the longitudinal dimension) inverse Fourier transform of the mode 
amplitudes Bn{kz). To obtain non-zero values for the contributions from multipoles higher than order 
n = \ (dipole), we need to choose non-zero values for either the xory coordinates at which we compute 
the field. We choose (arbitrarily) x = 8 mm, and = mni- The contributions to the vertical field 
component from multipoles of order 1 through 7 are shown in Fig. [23] Note that multipoles of even 
order are forbidden by the symmetry of the wiggler (see Section |5.1| ). We see from Fig. [23] that the 
dominant contribution by far is, as expected, the dipole component. The sextupole component is not 
insignificant; the contributions of higher order multipoles are extremely small, and the high-frequency 
"oscillation" as a function of longitudinal position is probably unphysical, and the result of noise in the 
fitting. 

It is worth making a few final remarks about mode decompositions for three-dimensional fields. 
First, as already mentioned, in many cases a full three-dimensional mode decomposition will not be 
necessary. While this does provide a detailed description of the field in a form suitable for beam dy- 
namics studies, three-dimensional decompositions do rely on a large number of accurate and detailed 
field measurements. While such "measurements" may be conveniently obtained from a model, it may 
be difficult or impractical to make such measurements on a real magnet. Fortunately, in many cases, 
a two-dimensional field description in terms of multipoles is sufficient. Generally, a three-dimensional 
analysis only need be undertaken where there are grounds to believe that the three-dimensional nature of 
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Fig. 23: Multipole contributions to the field in the TESLA damping wiggler as a function of longitudinal position, 
at X =8 mm and y =Omm, for orders 1 (dipole), 3 (sextupole), 5 (decapole) and 7. 



the field is likely to have a significant impact on the beam dynamics. 

Second, we have already emphasised that to obtain an accurate description of the field within some 
region in terms of a mode decomposition, the mode amplitudes should be determined by a fit on a surface 
enclosing the region of interest. Outside the region bounded by the surface of the fit, the fitted field can 
be expected to diverge exponentially from the real field. However, in choosing the surface for the fit, the 
geometry of the magnet will impose some constraints. A magnet with a wide rectangular aperture may 
lend itself to a description using a Cartesian basis (fitting on the surface of a rectangular box); a circular 
aperture, however, is more likely to require use of a polar basis (fitting on the surface of a cylinder with 
circular cross-section). Both cases have been described above. It may be appropriate in other cases to 
perform a fit on the surface of a cylinder with elliptical cross-section. The basis functions in this case 
involve Mathieu functions. For further details, the reader is referred to work by Dragt 1 12 1 and by Dragt 
and Mitchell tl3J. 



Appendices 

A The vector potential 

Our analysis of iron-dominated multipole magnets in Section |4.2| was based on the magnetic scalar 
potential, Lp. The magnetic flux density can be derived from a scalar potential: 

B = — grad(/9 

in the case that the flux density has vanishing divergence and curl: 

divS = curl 5 = 0. 

More generally (in particular, where the flux density has non- vanishing curl) one derives the magnetic 
flux density from a vector potential A, using: 

S = curlA (A.l) 
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Although we have not required the vector potential in our discussion of Maxwell's equations for accel- 
erator magnets, it is sometimes used in analysis of beam dynamics. In particular, descriptions of the 
dynamics based on Hamiltonian mechanics generally use the vector potential rather than the magnetic 
flux density or the magnetic scalar potential. We therefore include here a brief discussion of the vector 
potential, paying attention to aspects relevant to the descriptions we have developed for two-dimensional 
and three-dimensional magnet fields. 

First, we note that the divergence of any curl is identically zero: 

div curl y = 0, 

for any differentiable vector field V. Thus, if we write B = curl A, then Maxwell's equation 

divS = 0, 

is automatically satisfied. Maxwell's equation ([3]) in uniform media (constant permeability), with zero 
current and static electric fields gives: 

curlS = /xJ, (A.2) 
where J is the current density. This leads to the equation for the vector potential: 

curl curl A = grad (div A) - V'^A fiJ. (A.3) 



Eq. ( |A.3| ) is a second-order differential equation for the vector potential in a medium with permeability 



/i, where the current density is J. This appears harder to solve than the first-order differential equation 
for the magnetic flux density, Eq. ( |A.2| ). However, Eq. ( |A.3| ) may be simplified significantly, if we apply 



an appropriate gauge condition. To understand what this means, recall that the magnetic flux density is 
given by the curl of the vector potential, and that the curl of the gradient of any scalar field is identically 
zero. Thus, we can add the gradient of a scalar field to a vector potential, and obtain a new vector 
potential that gives the same flux density as the old one. That is, if: 

B = curl A, 

and: 

A' = A + grad^, (A.4) 
for an arbitrary differentiable scalar field ^, then: 

curl A' — curl A + curl grad — curl A — B. 

In other words, the vector potential A' leads to exactly the same flux density as the vector potential A. 
Since the dynamics of a given system are determined by t he fi elds rather than the potentials, either A' or 



A is a valid choice for the description of the system. Eq. \AA) is known as a gauge transformation. The 
consequence of having the freedom to make a gauge transformation means that the vector potential for 
any given system is not uniquely defined: given some particular vector potential, it is always possible to 
make a gauge tranformation without any change in the physical observables of a system. The analogue 
in the case of electric fields, of course, is that the "zero" of the electric scalar potential can be chosen ar- 
bitrarily: only changes in potential (i.e. energy) are observable, so given some particular scalar potential 
field, it is possible to add a constant (that is, a quantity independent of position) and obtain a new scalar 
potential that gives the same physical observables as the original scalar potential. 

For magnetostatic fields, we can use a gauge transformation to simplify Eq. ( |A.3| ). Suppose we 



have obtained a vector potential A for some particular physical system. Define a scalar field ^, which 
satisfies: 

V^^ = -div A (A.5) 
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Then define: 

A' = A-\- grad^. 

Since A' and A are related by a gauge transformation, they lead to the same magnetic flux density, and 
the same physical observables for the system. However, the divergence of A' vanishes: 

db/A' div A + divgrad^ -V^^ + V^^ 0, 

where we have used Eq. ( |A.5| ). Thus, given any vector potential, we can make a gauge transformation to 
find a new vector potential that gives the same magnetic flux density, but has vanishing divergence. The 
gauge condition: 

divl=0, (A.6) 

is known as the Coulomb gauge. It is possible to work with other gauge conditions (for example, for 
time-dependent electromagnetic fields the Lorenz gauge condition is often more appropriate); however, 
for our present purposes, the Coulomb gauge leads to a simplification of Eq. ( |A.3| ), which now becomes: 

V^l = J. (A.7) 

Eq. ( |A.7| ) is Poisson's equation for a vector field. Note that despite being a second-order differential 
equation, it is in a sense simpler than Maxwell's equation ([3]), since we have "decoupled" the components 
of the vectors; that is, we have a set of three uncoupled second-order differential equations, where each 
equation relates a component of the vector potential to the corresponding component of the current 
density. Eq. ( |A.7| ) has the solution: 



4:7T J \r - r'\ 



In this form, we see that the potential at a point in space is inversely proportional to the distance from 
the source. 

Now, consider the potential given by: 

A^ = 0, Ay = 0, A, = -Re^^i^^±i^. (A.8) 

n 



Taking derivatives, we find that: 

dx 
OA, 

dy 

Then, since A^ and Ay are zero, we have: 



-KtCn{x + iyT-^, 
\mCn{x + iyT-\ 



^ (dA, dA, 

B = curM = — — , — — ,0 

\ oy ox 

Hence: 

By + iB^^Cn{x + iyT-\ (A.9) 



which is just the multipole field. Thus, Eq. ( |A.8| ) is a potential that gives a multipole field. Note also that, 
since Az is independent of z, this potential satisfies the Coulomb gauge condition ( |A.6| ): 

,. J. dAy dA^ ^ 

div^ = — ^ + — ^ + — ^ = 0. 

ox oy oz 
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An advantage of working with the vector potential in the Coulomb gauge is that, for multipole 
fields, the transverse components of the vector potential are both zero. This simplifies, to some extent, 
the Hamiltonian equations of motion for a particle moving through a multipole field. However, note that 
the longitudinal component Bz of the magnetic flux density is zero in this case. To generate a solenoidal 
field, with Bz equal to a non-zero constant, we need to introduce non-zero components for Ax, or Ay, or 
both. For example, a solenoid field with flux density B^oi may be derived from the vector potential: 

A-x — ~~Bso\y^ Ay — —B^qix. 



Let us return for a moment to the case of multipole fields. If we work in a gauge in which the 
transverse components of the vector potential are both zero, then the field components are given by: 

B B,^^. 

^ dx ^ ^ dy ' 

From these expressions, we see that if we take any two points with the same y coordinate, then the 
difference in the vector potential between these two points is given by the "flux" passing through a line 
between these points: 

/\Az = - J By dx. 
Similarly for any two points with the same x coordinate: 



AAz - J Bx dy. 

In general, for a field that is independent of z, and working in a gauge where Ax ^ Ay ^ 0, we can 
write: 

AA, = — , (A. 10) 

where AA^ is the change in the vector potential between two points Pi and P2 in a given plane z ^ zq\ 



and A$ is the magnetic flux through a rectangular "loop" with vertices Pi, P2, P3 and P4: see Fig. A.l 



P3 and P4 are points obtained by transporting Pi and P2 a distance Az parallel to the z axis. Eq. ( |A.10| ) 
can also be obtained by applying Stokes' theorem to the loop P1P2P3P4, with the relationship ( |A.1| ) 
between B and A: 

J A-dl^ J curl A -dS^ J B-dS, 

hence: 

Az{P2)Az-Az{Pi)Az = A^. 

Finally, we give the vector potentials corresponding to three-dimensional fields. In the Cartesian 
basis, with the field given by Eqs. (|43])-(|45]), a possible vector potential (in the Coulomb gauge) is: 



Ax = 0, 

Ay — Bq ^ sin kxX sinh kyy COS kzZ^ 

rZx rvy 

Az = — Bq-— sin kxX cosh kyy sin kzZ. 

kx 



In the polar basis, with the field given by Eqs. ([52|)-([54|), a possible vector potential is: 

A-f — 
Ae = 0, 
Az = -- 

However, note that this potential does not satisfy the Coulomb gauge condition. 



T 

Ar = IrJkzr) COS nO sin kzZ, 

m 



T 

Az — iLlkzv) cos 710 sin kzZ. 

2m 
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Fig. A.l: Interpretation of the vector potential in a two-dimensional magnetic field (i.e. a field that is independent 
of z). The change in the vector potential between P\ and P2 is equal to the flux of the magnetic field through the 
loop P1P2P3P4, divided by Az. 
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